High-Dimensional Sparse Econometric Models, an Introduction

نویسنده

  • Alexandre Belloni
چکیده

In this chapter we discuss conceptually high dimensional sparse econometric models as well as estimation of these models using l1-penalization and postl1-penalization methods. Focusing on linear and nonparametric regression frameworks, we discuss various econometric examples, present basic theoretical results, and illustrate the concepts and methods with Monte Carlo simulations and an empirical application. In the application, we examine and confirm the empirical validity of the Solow-Swan model for international economic growth. 1 The High Dimensional Sparse Econometric Model We consider linear, high dimensional sparse (HDS) regression models in econometrics. The HDS regression model has a large number of regressors p, possibly much larger than the sample size n, but only a relatively small number s < n of these regressors are important for capturing accurately the main features of the regression function. The latter assumption makes it possible to estimate these models effectively by searching for approximately the right set of the regressors, using l1-based penalization methods. In this chapter we will review the basic theoretical properties of these procedures, established in the works of [8, 10, 18, 17, 7, 15, 13, 27, 26], among others (see [20, 7] for a detailed literature review). In this section, we review the modeling foundations as well as motivating examples for these procedures, with emphasis on applications in econometrics. Let us first consider an exact or parametric HDS regression model, namely, Alexandre Belloni Duke University, Fuqua School of Business, 100 Fuqua Drive, Durham, NC, e-mail: [email protected] Victor Chernozhukov Massachusetts Institute of Technology, Department of Economics, 50 Memorial Drive, Cambridge, MA e-mail: [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inference for high-dimensional sparse econometric models

This article is about estimation and inference methods for high dimensional sparse (HDS) regression models in econometrics. High dimensional sparse models arise in situations where many regressors (or series terms) are available and the regression function is wellapproximated by a parsimonious, yet unknown set of regressors. The latter condition makes it possible to estimate the entire regressi...

متن کامل

Power Enhancement in High Dimensional Cross-Sectional Tests.

We propose a novel technique to boost the power of testing a high-dimensional vector H : θ = 0 against sparse alternatives where the null hypothesis is violated only by a couple of components. Existing tests based on quadratic forms such as the Wald statistic often suffer from low powers due to the accumulation of errors in estimating high-dimensional parameters. More powerful tests for sparse ...

متن کامل

TPS 2013 (1).indd

Modeling of complex systems is commonly confronted with high dimensional set of independent variables. Similarly, econometric models are usually built using time series data that often exhibit nonstationarity due to the impact of some policies and other economic forces. In both cases, linear regression modeling may yield unstable least squares estimates of the regression coeffi cients. Principa...

متن کامل

Robust Estimation in Linear Regression with Molticollinearity and Sparse Models

‎One of the factors affecting the statistical analysis of the data is the presence of outliers‎. ‎The methods which are not affected by the outliers are called robust methods‎. ‎Robust regression methods are robust estimation methods of regression model parameters in the presence of outliers‎. ‎Besides outliers‎, ‎the linear dependency of regressor variables‎, ‎which is called multicollinearity...

متن کامل

Mammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease

Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011